Overview

Dataset statistics

Number of variables39
Number of observations1000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory304.8 KiB
Average record size in memory312.1 B

Variable types

Numeric14
Categorical24
Boolean1

Alerts

policy_bind_date has a high cardinality: 951 distinct values High cardinality
incident_date has a high cardinality: 60 distinct values High cardinality
incident_location has a high cardinality: 1000 distinct values High cardinality
months_as_customer is highly correlated with ageHigh correlation
age is highly correlated with months_as_customerHigh correlation
total_claim_amount is highly correlated with injury_claim and 2 other fieldsHigh correlation
injury_claim is highly correlated with total_claim_amount and 2 other fieldsHigh correlation
property_claim is highly correlated with total_claim_amount and 2 other fieldsHigh correlation
vehicle_claim is highly correlated with total_claim_amount and 2 other fieldsHigh correlation
months_as_customer is highly correlated with ageHigh correlation
age is highly correlated with months_as_customerHigh correlation
total_claim_amount is highly correlated with injury_claim and 2 other fieldsHigh correlation
injury_claim is highly correlated with total_claim_amount and 2 other fieldsHigh correlation
property_claim is highly correlated with total_claim_amount and 2 other fieldsHigh correlation
vehicle_claim is highly correlated with total_claim_amount and 2 other fieldsHigh correlation
months_as_customer is highly correlated with ageHigh correlation
age is highly correlated with months_as_customerHigh correlation
total_claim_amount is highly correlated with injury_claim and 2 other fieldsHigh correlation
injury_claim is highly correlated with total_claim_amount and 1 other fieldsHigh correlation
property_claim is highly correlated with total_claim_amount and 1 other fieldsHigh correlation
vehicle_claim is highly correlated with total_claim_amount and 2 other fieldsHigh correlation
incident_severity is highly correlated with fraud_reportedHigh correlation
collision_type is highly correlated with incident_typeHigh correlation
number_of_vehicles_involved is highly correlated with incident_typeHigh correlation
incident_type is highly correlated with collision_type and 1 other fieldsHigh correlation
auto_make is highly correlated with auto_modelHigh correlation
fraud_reported is highly correlated with incident_severityHigh correlation
auto_model is highly correlated with auto_makeHigh correlation
months_as_customer is highly correlated with ageHigh correlation
age is highly correlated with months_as_customerHigh correlation
incident_type is highly correlated with collision_type and 7 other fieldsHigh correlation
collision_type is highly correlated with incident_type and 7 other fieldsHigh correlation
incident_severity is highly correlated with incident_type and 6 other fieldsHigh correlation
authorities_contacted is highly correlated with incident_type and 5 other fieldsHigh correlation
number_of_vehicles_involved is highly correlated with incident_type and 1 other fieldsHigh correlation
total_claim_amount is highly correlated with incident_type and 6 other fieldsHigh correlation
injury_claim is highly correlated with incident_type and 6 other fieldsHigh correlation
property_claim is highly correlated with incident_type and 6 other fieldsHigh correlation
vehicle_claim is highly correlated with incident_type and 6 other fieldsHigh correlation
auto_make is highly correlated with auto_modelHigh correlation
auto_model is highly correlated with auto_makeHigh correlation
fraud_reported is highly correlated with incident_severityHigh correlation
policy_bind_date is uniformly distributed Uniform
incident_location is uniformly distributed Uniform
policy_number has unique values Unique
incident_location has unique values Unique
umbrella_limit has 798 (79.8%) zeros Zeros
capital-gains has 508 (50.8%) zeros Zeros
capital-loss has 475 (47.5%) zeros Zeros
incident_hour_of_the_day has 52 (5.2%) zeros Zeros
injury_claim has 25 (2.5%) zeros Zeros
property_claim has 19 (1.9%) zeros Zeros

Reproduction

Analysis started2022-02-20 17:39:31.995895
Analysis finished2022-02-20 17:40:19.428214
Duration47.43 seconds
Software versionpandas-profiling v3.1.0
Download configurationconfig.json

Variables

months_as_customer
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct391
Distinct (%)39.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean203.954
Minimum0
Maximum479
Zeros1
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:19.516567image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile28.9
Q1115.75
median199.5
Q3276.25
95-th percentile429.05
Maximum479
Range479
Interquartile range (IQR)160.5

Descriptive statistics

Standard deviation115.1131744
Coefficient of variation (CV)0.5644075352
Kurtosis-0.4854280674
Mean203.954
Median Absolute Deviation (MAD)80.5
Skewness0.3621768478
Sum203954
Variance13251.04293
MonotonicityNot monotonic
2022-02-20T18:40:19.698343image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1948
 
0.8%
1287
 
0.7%
2547
 
0.7%
1407
 
0.7%
2107
 
0.7%
2307
 
0.7%
2857
 
0.7%
1017
 
0.7%
2396
 
0.6%
1266
 
0.6%
Other values (381)931
93.1%
ValueCountFrequency (%)
01
 
0.1%
13
0.3%
22
0.2%
32
0.2%
43
0.3%
52
0.2%
61
 
0.1%
71
 
0.1%
83
0.3%
92
0.2%
ValueCountFrequency (%)
4792
0.2%
4782
0.2%
4761
0.1%
4752
0.2%
4731
0.1%
4721
0.1%
4681
0.1%
4671
0.1%
4651
0.1%
4641
0.1%

age
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct46
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38.948
Minimum19
Maximum64
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:19.896359image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum19
5-th percentile26
Q132
median38
Q344
95-th percentile57
Maximum64
Range45
Interquartile range (IQR)12

Descriptive statistics

Standard deviation9.140286694
Coefficient of variation (CV)0.2346792311
Kurtosis-0.260255015
Mean38.948
Median Absolute Deviation (MAD)6
Skewness0.4789880471
Sum38948
Variance83.54484084
MonotonicityNot monotonic
2022-02-20T18:40:20.067697image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
4349
 
4.9%
3948
 
4.8%
4145
 
4.5%
3444
 
4.4%
3842
 
4.2%
3042
 
4.2%
3142
 
4.2%
3741
 
4.1%
3339
 
3.9%
4038
 
3.8%
Other values (36)570
57.0%
ValueCountFrequency (%)
191
 
0.1%
201
 
0.1%
216
 
0.6%
221
 
0.1%
237
 
0.7%
2410
 
1.0%
2514
1.4%
2626
2.6%
2724
2.4%
2830
3.0%
ValueCountFrequency (%)
642
 
0.2%
632
 
0.2%
624
 
0.4%
6110
1.0%
609
0.9%
595
 
0.5%
588
0.8%
5716
1.6%
568
0.8%
5514
1.4%

policy_number
Real number (ℝ≥0)

UNIQUE

Distinct1000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean546238.648
Minimum100804
Maximum999435
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:20.248889image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum100804
5-th percentile143969.6
Q1335980.25
median533135
Q3759099.75
95-th percentile954279.1
Maximum999435
Range898631
Interquartile range (IQR)423119.5

Descriptive statistics

Standard deviation257063.0053
Coefficient of variation (CV)0.4706056706
Kurtosis-1.132637689
Mean546238.648
Median Absolute Deviation (MAD)210974
Skewness0.03899064218
Sum546238648
Variance6.608138868 × 1010
MonotonicityNot monotonic
2022-02-20T18:40:20.467249image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5215851
 
0.1%
6877551
 
0.1%
6744851
 
0.1%
2234041
 
0.1%
9914801
 
0.1%
8042191
 
0.1%
4830881
 
0.1%
1008041
 
0.1%
9418071
 
0.1%
5934661
 
0.1%
Other values (990)990
99.0%
ValueCountFrequency (%)
1008041
0.1%
1014211
0.1%
1045941
0.1%
1061861
0.1%
1068731
0.1%
1071811
0.1%
1082701
0.1%
1088441
0.1%
1093921
0.1%
1100841
0.1%
ValueCountFrequency (%)
9994351
0.1%
9988651
0.1%
9981921
0.1%
9968501
0.1%
9962531
0.1%
9945381
0.1%
9938401
0.1%
9921451
0.1%
9915531
0.1%
9914801
0.1%

policy_bind_date
Categorical

HIGH CARDINALITY
UNIFORM

Distinct951
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2006-01-01
 
3
1992-04-28
 
3
1992-08-05
 
3
1991-12-14
 
2
2004-08-09
 
2
Other values (946)
987 

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique905 ?
Unique (%)90.5%

Sample

1st row2014-10-17
2nd row2006-06-27
3rd row2000-09-06
4th row1990-05-25
5th row2014-06-06

Common Values

ValueCountFrequency (%)
2006-01-013
 
0.3%
1992-04-283
 
0.3%
1992-08-053
 
0.3%
1991-12-142
 
0.2%
2004-08-092
 
0.2%
2010-01-282
 
0.2%
1999-09-292
 
0.2%
2001-09-252
 
0.2%
2000-05-042
 
0.2%
1997-02-032
 
0.2%
Other values (941)977
97.7%

Length

2022-02-20T18:40:20.688645image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2006-01-013
 
0.3%
1992-08-053
 
0.3%
1992-04-283
 
0.3%
1996-09-212
 
0.2%
2014-07-052
 
0.2%
1997-05-152
 
0.2%
1992-04-142
 
0.2%
2003-03-092
 
0.2%
1997-11-072
 
0.2%
1992-01-052
 
0.2%
Other values (941)977
97.7%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

policy_state
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
OH
352 
IL
338 
IN
310 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOH
2nd rowIN
3rd rowOH
4th rowIL
5th rowIL

Common Values

ValueCountFrequency (%)
OH352
35.2%
IL338
33.8%
IN310
31.0%

Length

2022-02-20T18:40:20.827847image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:20.921104image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
oh352
35.2%
il338
33.8%
in310
31.0%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

policy_csl
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
250/500
351 
100/300
349 
500/1000
300 

Length

Max length8
Median length7
Mean length7.3
Min length7

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row250/500
2nd row250/500
3rd row100/300
4th row250/500
5th row500/1000

Common Values

ValueCountFrequency (%)
250/500351
35.1%
100/300349
34.9%
500/1000300
30.0%

Length

2022-02-20T18:40:21.027077image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:21.126823image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
250/500351
35.1%
100/300349
34.9%
500/1000300
30.0%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
1000
351 
500
342 
2000
307 

Length

Max length4
Median length4
Mean length3.658
Min length3

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1000
2nd row2000
3rd row2000
4th row2000
5th row1000

Common Values

ValueCountFrequency (%)
1000351
35.1%
500342
34.2%
2000307
30.7%

Length

2022-02-20T18:40:21.225728image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:21.314017image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
1000351
35.1%
500342
34.2%
2000307
30.7%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

policy_annual_premium
Real number (ℝ≥0)

Distinct991
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1256.40615
Minimum433.33
Maximum2047.59
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:21.426667image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum433.33
5-th percentile855.112
Q11089.6075
median1257.2
Q31415.695
95-th percentile1653.4435
Maximum2047.59
Range1614.26
Interquartile range (IQR)326.0875

Descriptive statistics

Standard deviation244.167395
Coefficient of variation (CV)0.1943379495
Kurtosis0.07388944021
Mean1256.40615
Median Absolute Deviation (MAD)164.26
Skewness0.004401994527
Sum1256406.15
Variance59617.71676
MonotonicityNot monotonic
2022-02-20T18:40:21.693206image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1558.292
 
0.2%
1215.362
 
0.2%
1362.872
 
0.2%
1073.832
 
0.2%
1389.132
 
0.2%
1074.072
 
0.2%
1374.222
 
0.2%
1524.452
 
0.2%
1281.252
 
0.2%
1230.691
 
0.1%
Other values (981)981
98.1%
ValueCountFrequency (%)
433.331
0.1%
484.671
0.1%
538.171
0.1%
566.111
0.1%
617.111
0.1%
625.081
0.1%
653.661
0.1%
664.861
0.1%
671.011
0.1%
671.921
0.1%
ValueCountFrequency (%)
2047.591
0.1%
1969.631
0.1%
1935.851
0.1%
1927.871
0.1%
1922.841
0.1%
1896.911
0.1%
1878.441
0.1%
1865.831
0.1%
1863.041
0.1%
1861.431
0.1%

umbrella_limit
Real number (ℝ)

ZEROS

Distinct11
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1101000
Minimum-1000000
Maximum10000000
Zeros798
Zeros (%)79.8%
Negative1
Negative (%)0.1%
Memory size7.9 KiB
2022-02-20T18:40:22.067816image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum-1000000
5-th percentile0
Q10
median0
Q30
95-th percentile6000000
Maximum10000000
Range11000000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2297406.598
Coefficient of variation (CV)2.086654494
Kurtosis1.79207731
Mean1101000
Median Absolute Deviation (MAD)0
Skewness1.806712199
Sum1101000000
Variance5.278077077 × 1012
MonotonicityNot monotonic
2022-02-20T18:40:22.203339image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
0798
79.8%
600000057
 
5.7%
500000046
 
4.6%
400000039
 
3.9%
700000029
 
2.9%
300000012
 
1.2%
80000008
 
0.8%
90000005
 
0.5%
20000003
 
0.3%
100000002
 
0.2%
ValueCountFrequency (%)
-10000001
 
0.1%
0798
79.8%
20000003
 
0.3%
300000012
 
1.2%
400000039
 
3.9%
500000046
 
4.6%
600000057
 
5.7%
700000029
 
2.9%
80000008
 
0.8%
90000005
 
0.5%
ValueCountFrequency (%)
100000002
 
0.2%
90000005
 
0.5%
80000008
 
0.8%
700000029
 
2.9%
600000057
 
5.7%
500000046
 
4.6%
400000039
 
3.9%
300000012
 
1.2%
20000003
 
0.3%
0798
79.8%

insured_zip
Real number (ℝ≥0)

Distinct995
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean501214.488
Minimum430104
Maximum620962
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:22.360714image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum430104
5-th percentile433273.75
Q1448404.5
median466445.5
Q3603251
95-th percentile617463.35
Maximum620962
Range190858
Interquartile range (IQR)154846.5

Descriptive statistics

Standard deviation71701.61094
Coefficient of variation (CV)0.1430557429
Kurtosis-1.190711054
Mean501214.488
Median Absolute Deviation (MAD)21841
Skewness0.8165539259
Sum501214488
Variance5141121012
MonotonicityNot monotonic
2022-02-20T18:40:22.518704image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4776952
 
0.2%
4694292
 
0.2%
4468952
 
0.2%
4312022
 
0.2%
4566022
 
0.2%
4661321
 
0.1%
4522181
 
0.1%
6089821
 
0.1%
4596301
 
0.1%
4531931
 
0.1%
Other values (985)985
98.5%
ValueCountFrequency (%)
4301041
0.1%
4301411
0.1%
4302321
0.1%
4303801
0.1%
4305671
0.1%
4306211
0.1%
4306321
0.1%
4306651
0.1%
4307141
0.1%
4308321
0.1%
ValueCountFrequency (%)
6209621
0.1%
6208691
0.1%
6208191
0.1%
6207571
0.1%
6207371
0.1%
6205071
0.1%
6204931
0.1%
6204731
0.1%
6203581
0.1%
6202071
0.1%

insured_sex
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
FEMALE
537 
MALE
463 

Length

Max length6
Median length6
Mean length5.074
Min length4

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMALE
2nd rowMALE
3rd rowFEMALE
4th rowFEMALE
5th rowMALE

Common Values

ValueCountFrequency (%)
FEMALE537
53.7%
MALE463
46.3%

Length

2022-02-20T18:40:22.662609image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:22.744930image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
female537
53.7%
male463
46.3%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

Distinct7
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
JD
161 
High School
160 
Associate
145 
MD
144 
Masters
143 
Other values (2)
247 

Length

Max length11
Median length7
Mean length5.905
Min length2

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMD
2nd rowMD
3rd rowPhD
4th rowPhD
5th rowAssociate

Common Values

ValueCountFrequency (%)
JD161
16.1%
High School160
16.0%
Associate145
14.5%
MD144
14.4%
Masters143
14.3%
PhD125
12.5%
College122
12.2%

Length

2022-02-20T18:40:22.815256image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:22.902913image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
jd161
13.9%
high160
13.8%
school160
13.8%
associate145
12.5%
md144
12.4%
masters143
12.3%
phd125
10.8%
college122
10.5%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

Distinct14
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
machine-op-inspct
93 
prof-specialty
85 
tech-support
78 
sales
76 
exec-managerial
76 
Other values (9)
592 

Length

Max length17
Median length14
Mean length13.521
Min length5

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowcraft-repair
2nd rowmachine-op-inspct
3rd rowsales
4th rowarmed-forces
5th rowsales

Common Values

ValueCountFrequency (%)
machine-op-inspct93
 
9.3%
prof-specialty85
 
8.5%
tech-support78
 
7.8%
sales76
 
7.6%
exec-managerial76
 
7.6%
craft-repair74
 
7.4%
transport-moving72
 
7.2%
other-service71
 
7.1%
priv-house-serv71
 
7.1%
armed-forces69
 
6.9%
Other values (4)235
23.5%

Length

2022-02-20T18:40:23.033358image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
machine-op-inspct93
 
9.3%
prof-specialty85
 
8.5%
tech-support78
 
7.8%
sales76
 
7.6%
exec-managerial76
 
7.6%
craft-repair74
 
7.4%
transport-moving72
 
7.2%
other-service71
 
7.1%
priv-house-serv71
 
7.1%
armed-forces69
 
6.9%
Other values (4)235
23.5%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

insured_hobbies
Categorical

Distinct20
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
reading
 
64
exercise
 
57
paintball
 
57
bungie-jumping
 
56
movies
 
55
Other values (15)
711 

Length

Max length14
Median length8
Mean length8.113
Min length4

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowsleeping
2nd rowreading
3rd rowboard-games
4th rowboard-games
5th rowboard-games

Common Values

ValueCountFrequency (%)
reading64
 
6.4%
exercise57
 
5.7%
paintball57
 
5.7%
bungie-jumping56
 
5.6%
movies55
 
5.5%
golf55
 
5.5%
camping55
 
5.5%
kayaking54
 
5.4%
yachting53
 
5.3%
hiking52
 
5.2%
Other values (10)442
44.2%

Length

2022-02-20T18:40:23.180952image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
reading64
 
6.4%
exercise57
 
5.7%
paintball57
 
5.7%
bungie-jumping56
 
5.6%
movies55
 
5.5%
golf55
 
5.5%
camping55
 
5.5%
kayaking54
 
5.4%
yachting53
 
5.3%
hiking52
 
5.2%
Other values (10)442
44.2%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

Distinct6
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
own-child
183 
other-relative
177 
not-in-family
174 
husband
170 
wife
155 

Length

Max length14
Median length9
Mean length9.466
Min length4

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhusband
2nd rowother-relative
3rd rowown-child
4th rowunmarried
5th rowunmarried

Common Values

ValueCountFrequency (%)
own-child183
18.3%
other-relative177
17.7%
not-in-family174
17.4%
husband170
17.0%
wife155
15.5%
unmarried141
14.1%

Length

2022-02-20T18:40:23.343978image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:23.535185image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
own-child183
18.3%
other-relative177
17.7%
not-in-family174
17.4%
husband170
17.0%
wife155
15.5%
unmarried141
14.1%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

capital-gains
Real number (ℝ≥0)

ZEROS

Distinct338
Distinct (%)33.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25126.1
Minimum0
Maximum100500
Zeros508
Zeros (%)50.8%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:23.752840image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q351025
95-th percentile70615
Maximum100500
Range100500
Interquartile range (IQR)51025

Descriptive statistics

Standard deviation27872.18771
Coefficient of variation (CV)1.109292238
Kurtosis-1.276703511
Mean25126.1
Median Absolute Deviation (MAD)0
Skewness0.4788502296
Sum25126100
Variance776858847.6
MonotonicityNot monotonic
2022-02-20T18:40:23.981846image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0508
50.8%
463005
 
0.5%
515004
 
0.4%
685004
 
0.4%
556003
 
0.3%
497003
 
0.3%
517003
 
0.3%
567003
 
0.3%
476003
 
0.3%
440003
 
0.3%
Other values (328)461
46.1%
ValueCountFrequency (%)
0508
50.8%
8001
 
0.1%
100001
 
0.1%
110001
 
0.1%
121001
 
0.1%
128001
 
0.1%
131001
 
0.1%
141001
 
0.1%
161001
 
0.1%
173001
 
0.1%
ValueCountFrequency (%)
1005001
0.1%
988001
0.1%
948001
0.1%
919001
0.1%
907001
0.1%
888001
0.1%
884001
0.1%
878001
0.1%
849001
0.1%
839001
0.1%

capital-loss
Real number (ℝ)

ZEROS

Distinct354
Distinct (%)35.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-26793.7
Minimum-111100
Maximum0
Zeros475
Zeros (%)47.5%
Negative525
Negative (%)52.5%
Memory size7.9 KiB
2022-02-20T18:40:24.186826image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum-111100
5-th percentile-72305
Q1-51500
median-23250
Q30
95-th percentile0
Maximum0
Range111100
Interquartile range (IQR)51500

Descriptive statistics

Standard deviation28104.09669
Coefficient of variation (CV)-1.048906896
Kurtosis-1.3138745
Mean-26793.7
Median Absolute Deviation (MAD)23250
Skewness-0.391471943
Sum-26793700
Variance789840250.6
MonotonicityNot monotonic
2022-02-20T18:40:24.369270image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0475
47.5%
-317005
 
0.5%
-537005
 
0.5%
-503005
 
0.5%
-453004
 
0.4%
-510004
 
0.4%
-328004
 
0.4%
-538004
 
0.4%
-492004
 
0.4%
-314004
 
0.4%
Other values (344)486
48.6%
ValueCountFrequency (%)
-1111001
0.1%
-936001
0.1%
-914001
0.1%
-912001
0.1%
-906001
0.1%
-902001
0.1%
-901001
0.1%
-894001
0.1%
-883001
0.1%
-873001
0.1%
ValueCountFrequency (%)
0475
47.5%
-57001
 
0.1%
-63001
 
0.1%
-85001
 
0.1%
-106001
 
0.1%
-121001
 
0.1%
-132001
 
0.1%
-138001
 
0.1%
-156001
 
0.1%
-157002
 
0.2%

incident_date
Categorical

HIGH CARDINALITY

Distinct60
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2015-02-02
 
28
2015-02-17
 
26
2015-01-07
 
25
2015-01-10
 
24
2015-02-04
 
24
Other values (55)
873 

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015-01-25
2nd row2015-01-21
3rd row2015-02-22
4th row2015-01-10
5th row2015-02-17

Common Values

ValueCountFrequency (%)
2015-02-0228
 
2.8%
2015-02-1726
 
2.6%
2015-01-0725
 
2.5%
2015-01-1024
 
2.4%
2015-02-0424
 
2.4%
2015-01-2424
 
2.4%
2015-01-1923
 
2.3%
2015-01-0822
 
2.2%
2015-01-1321
 
2.1%
2015-01-3021
 
2.1%
Other values (50)762
76.2%

Length

2022-02-20T18:40:24.549383image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2015-02-0228
 
2.8%
2015-02-1726
 
2.6%
2015-01-0725
 
2.5%
2015-02-0424
 
2.4%
2015-01-2424
 
2.4%
2015-01-1024
 
2.4%
2015-01-1923
 
2.3%
2015-01-0822
 
2.2%
2015-01-1321
 
2.1%
2015-01-3021
 
2.1%
Other values (50)762
76.2%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

incident_type
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Multi-vehicle Collision
419 
Single Vehicle Collision
403 
Vehicle Theft
94 
Parked Car
84 

Length

Max length24
Median length23
Mean length21.371
Min length10

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSingle Vehicle Collision
2nd rowVehicle Theft
3rd rowMulti-vehicle Collision
4th rowSingle Vehicle Collision
5th rowVehicle Theft

Common Values

ValueCountFrequency (%)
Multi-vehicle Collision419
41.9%
Single Vehicle Collision403
40.3%
Vehicle Theft94
 
9.4%
Parked Car84
 
8.4%

Length

2022-02-20T18:40:24.699338image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:24.813056image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
collision822
34.2%
vehicle497
20.7%
multi-vehicle419
17.4%
single403
16.8%
theft94
 
3.9%
parked84
 
3.5%
car84
 
3.5%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

collision_type
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Rear Collision
292 
Side Collision
276 
Front Collision
254 
?
178 

Length

Max length15
Median length14
Mean length11.94
Min length1

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSide Collision
2nd row?
3rd rowRear Collision
4th rowFront Collision
5th row?

Common Values

ValueCountFrequency (%)
Rear Collision292
29.2%
Side Collision276
27.6%
Front Collision254
25.4%
?178
17.8%

Length

2022-02-20T18:40:24.945671image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:25.062496image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
collision822
45.1%
rear292
 
16.0%
side276
 
15.1%
front254
 
13.9%
178
 
9.8%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

incident_severity
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Minor Damage
354 
Total Loss
280 
Major Damage
276 
Trivial Damage
90 

Length

Max length14
Median length12
Mean length11.62
Min length10

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMajor Damage
2nd rowMinor Damage
3rd rowMinor Damage
4th rowMajor Damage
5th rowMinor Damage

Common Values

ValueCountFrequency (%)
Minor Damage354
35.4%
Total Loss280
28.0%
Major Damage276
27.6%
Trivial Damage90
 
9.0%

Length

2022-02-20T18:40:25.183120image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:25.291830image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
damage720
36.0%
minor354
17.7%
total280
 
14.0%
loss280
 
14.0%
major276
 
13.8%
trivial90
 
4.5%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

authorities_contacted
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Police
292 
Fire
223 
Other
198 
Ambulance
196 
None
91 

Length

Max length9
Median length5
Mean length5.762
Min length4

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPolice
2nd rowPolice
3rd rowPolice
4th rowPolice
5th rowNone

Common Values

ValueCountFrequency (%)
Police292
29.2%
Fire223
22.3%
Other198
19.8%
Ambulance196
19.6%
None91
 
9.1%

Length

2022-02-20T18:40:25.415501image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:25.601127image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
police292
29.2%
fire223
22.3%
other198
19.8%
ambulance196
19.6%
none91
 
9.1%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

incident_state
Categorical

Distinct7
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
NY
262 
SC
248 
WV
217 
VA
110 
NC
110 
Other values (2)
53 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSC
2nd rowVA
3rd rowNY
4th rowOH
5th rowNY

Common Values

ValueCountFrequency (%)
NY262
26.2%
SC248
24.8%
WV217
21.7%
VA110
11.0%
NC110
11.0%
PA30
 
3.0%
OH23
 
2.3%

Length

2022-02-20T18:40:26.007972image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:26.123318image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
ny262
26.2%
sc248
24.8%
wv217
21.7%
va110
11.0%
nc110
11.0%
pa30
 
3.0%
oh23
 
2.3%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

incident_city
Categorical

Distinct7
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Springfield
157 
Arlington
152 
Columbus
149 
Northbend
145 
Hillsdale
141 
Other values (2)
256 

Length

Max length11
Median length9
Mean length9.287
Min length8

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowColumbus
2nd rowRiverwood
3rd rowColumbus
4th rowArlington
5th rowArlington

Common Values

ValueCountFrequency (%)
Springfield157
15.7%
Arlington152
15.2%
Columbus149
14.9%
Northbend145
14.5%
Hillsdale141
14.1%
Riverwood134
13.4%
Northbrook122
12.2%

Length

2022-02-20T18:40:26.274865image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:26.392324image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
springfield157
15.7%
arlington152
15.2%
columbus149
14.9%
northbend145
14.5%
hillsdale141
14.1%
riverwood134
13.4%
northbrook122
12.2%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

incident_location
Categorical

HIGH CARDINALITY
UNIFORM
UNIQUE

Distinct1000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
9935 4th Drive
 
1
4214 MLK Ridge
 
1
8548 Cherokee Ridge
 
1
2352 MLK Drive
 
1
9734 2nd Ridge
 
1
Other values (995)
995 

Length

Max length23
Median length14
Mean length14.749
Min length11

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1000 ?
Unique (%)100.0%

Sample

1st row9935 4th Drive
2nd row6608 MLK Hwy
3rd row7121 Francis Lane
4th row6956 Maple Drive
5th row3041 3rd Ave

Common Values

ValueCountFrequency (%)
9935 4th Drive1
 
0.1%
4214 MLK Ridge1
 
0.1%
8548 Cherokee Ridge1
 
0.1%
2352 MLK Drive1
 
0.1%
9734 2nd Ridge1
 
0.1%
3122 Apache Drive1
 
0.1%
9816 Britain St1
 
0.1%
8214 Flute St1
 
0.1%
6259 Lincoln Hwy1
 
0.1%
4492 Andromedia Ave1
 
0.1%
Other values (990)990
99.0%

Length

2022-02-20T18:40:26.549105image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
drive173
 
5.8%
lane171
 
5.7%
ridge171
 
5.7%
st171
 
5.7%
ave161
 
5.4%
hwy153
 
5.1%
4th57
 
1.9%
5th52
 
1.7%
texas47
 
1.6%
francis45
 
1.5%
Other values (961)1799
60.0%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

incident_hour_of_the_day
Real number (ℝ≥0)

ZEROS

Distinct24
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.644
Minimum0
Maximum23
Zeros52
Zeros (%)5.2%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:26.692776image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16
median12
Q317
95-th percentile23
Maximum23
Range23
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.951372928
Coefficient of variation (CV)0.5969918351
Kurtosis-1.192940152
Mean11.644
Median Absolute Deviation (MAD)6
Skewness-0.03558446644
Sum11644
Variance48.32158559
MonotonicityNot monotonic
2022-02-20T18:40:26.833052image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
1754
 
5.4%
353
 
5.3%
052
 
5.2%
2351
 
5.1%
1649
 
4.9%
1346
 
4.6%
1046
 
4.6%
446
 
4.6%
644
 
4.4%
943
 
4.3%
Other values (14)516
51.6%
ValueCountFrequency (%)
052
5.2%
129
2.9%
231
3.1%
353
5.3%
446
4.6%
533
3.3%
644
4.4%
740
4.0%
836
3.6%
943
4.3%
ValueCountFrequency (%)
2351
5.1%
2238
3.8%
2142
4.2%
2034
3.4%
1940
4.0%
1841
4.1%
1754
5.4%
1649
4.9%
1539
3.9%
1443
4.3%

number_of_vehicles_involved
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
1
581 
3
358 
4
 
31
2
 
30

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row3
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1581
58.1%
3358
35.8%
431
 
3.1%
230
 
3.0%

Length

2022-02-20T18:40:26.962822image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:27.051039image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
1581
58.1%
3358
35.8%
431
 
3.1%
230
 
3.0%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

property_damage
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
?
360 
NO
338 
YES
302 

Length

Max length3
Median length2
Mean length1.942
Min length1

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYES
2nd row?
3rd rowNO
4th row?
5th rowNO

Common Values

ValueCountFrequency (%)
?360
36.0%
NO338
33.8%
YES302
30.2%

Length

2022-02-20T18:40:27.160399image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:27.262154image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
360
36.0%
no338
33.8%
yes302
30.2%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

bodily_injuries
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
0
340 
2
332 
1
328 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row2
4th row1
5th row0

Common Values

ValueCountFrequency (%)
0340
34.0%
2332
33.2%
1328
32.8%

Length

2022-02-20T18:40:27.345566image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:27.424770image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
0340
34.0%
2332
33.2%
1328
32.8%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

witnesses
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
1
258 
2
250 
0
249 
3
243 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row0
3rd row3
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1258
25.8%
2250
25.0%
0249
24.9%
3243
24.3%

Length

2022-02-20T18:40:27.502816image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:27.577244image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
1258
25.8%
2250
25.0%
0249
24.9%
3243
24.3%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
?
343 
NO
343 
YES
314 

Length

Max length3
Median length2
Mean length1.971
Min length1

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYES
2nd row?
3rd rowNO
4th rowNO
5th rowNO

Common Values

ValueCountFrequency (%)
?343
34.3%
NO343
34.3%
YES314
31.4%

Length

2022-02-20T18:40:27.666077image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category

Pie chart

2022-02-20T18:40:27.748326image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
ValueCountFrequency (%)
343
34.3%
no343
34.3%
yes314
31.4%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

total_claim_amount
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct763
Distinct (%)76.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52761.94
Minimum100
Maximum114920
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:27.840113image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile4320
Q141812.5
median58055
Q370592.5
95-th percentile88413
Maximum114920
Range114820
Interquartile range (IQR)28780

Descriptive statistics

Standard deviation26401.53319
Coefficient of variation (CV)0.5003897353
Kurtosis-0.4540814267
Mean52761.94
Median Absolute Deviation (MAD)13855
Skewness-0.5945819885
Sum52761940
Variance697040954.8
MonotonicityNot monotonic
2022-02-20T18:40:27.992850image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
594005
 
0.5%
26404
 
0.4%
704004
 
0.4%
43204
 
0.4%
442004
 
0.4%
754004
 
0.4%
606004
 
0.4%
31904
 
0.4%
585004
 
0.4%
702904
 
0.4%
Other values (753)959
95.9%
ValueCountFrequency (%)
1001
 
0.1%
19201
 
0.1%
21601
 
0.1%
22501
 
0.1%
24001
 
0.1%
25201
 
0.1%
26404
0.4%
27002
0.2%
28001
 
0.1%
28601
 
0.1%
ValueCountFrequency (%)
1149201
0.1%
1123201
0.1%
1084801
0.1%
1080301
0.1%
1079001
0.1%
1058201
0.1%
1050401
0.1%
1046101
0.1%
1035601
0.1%
1018601
0.1%

injury_claim
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct638
Distinct (%)63.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7433.42
Minimum0
Maximum21450
Zeros25
Zeros (%)2.5%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:28.146357image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile450
Q14295
median6775
Q311305
95-th percentile15662
Maximum21450
Range21450
Interquartile range (IQR)7010

Descriptive statistics

Standard deviation4880.951853
Coefficient of variation (CV)0.6566226385
Kurtosis-0.7630870611
Mean7433.42
Median Absolute Deviation (MAD)3705
Skewness0.2648108785
Sum7433420
Variance23823690.99
MonotonicityNot monotonic
2022-02-20T18:40:28.284639image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
025
 
2.5%
6407
 
0.7%
4807
 
0.7%
6605
 
0.5%
5805
 
0.5%
135205
 
0.5%
11805
 
0.5%
8605
 
0.5%
63405
 
0.5%
7805
 
0.5%
Other values (628)926
92.6%
ValueCountFrequency (%)
025
2.5%
101
 
0.1%
2201
 
0.1%
2501
 
0.1%
2802
 
0.2%
2901
 
0.1%
3003
 
0.3%
3302
 
0.2%
3501
 
0.1%
3601
 
0.1%
ValueCountFrequency (%)
214501
0.1%
213301
0.1%
207001
0.1%
190201
0.1%
185201
0.1%
182201
0.1%
181801
0.1%
180801
0.1%
180001
0.1%
178801
0.1%

property_claim
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct626
Distinct (%)62.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7399.57
Minimum0
Maximum23670
Zeros19
Zeros (%)1.9%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:28.432570image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile450
Q14445
median6750
Q310885
95-th percentile15540
Maximum23670
Range23670
Interquartile range (IQR)6440

Descriptive statistics

Standard deviation4824.726179
Coefficient of variation (CV)0.6520279122
Kurtosis-0.3763863117
Mean7399.57
Median Absolute Deviation (MAD)3290
Skewness0.3781687764
Sum7399570
Variance23277982.7
MonotonicityNot monotonic
2022-02-20T18:40:28.625868image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
019
 
1.9%
8606
 
0.6%
4805
 
0.5%
6605
 
0.5%
100005
 
0.5%
6405
 
0.5%
6505
 
0.5%
110805
 
0.5%
8404
 
0.4%
53104
 
0.4%
Other values (616)937
93.7%
ValueCountFrequency (%)
019
1.9%
201
 
0.1%
2401
 
0.1%
2501
 
0.1%
2601
 
0.1%
2803
 
0.3%
2902
 
0.2%
3003
 
0.3%
3203
 
0.3%
3301
 
0.1%
ValueCountFrequency (%)
236701
0.1%
218101
0.1%
216301
0.1%
215801
0.1%
212401
0.1%
205501
0.1%
203101
0.1%
202801
0.1%
199501
0.1%
196501
0.1%

vehicle_claim
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct726
Distinct (%)72.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37928.95
Minimum70
Maximum79560
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:28.800860image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum70
5-th percentile3273.5
Q130292.5
median42100
Q350822.5
95-th percentile63094.5
Maximum79560
Range79490
Interquartile range (IQR)20530

Descriptive statistics

Standard deviation18886.25289
Coefficient of variation (CV)0.4979376675
Kurtosis-0.4465729231
Mean37928.95
Median Absolute Deviation (MAD)9840
Skewness-0.6210979312
Sum37928950
Variance356690548.3
MonotonicityNot monotonic
2022-02-20T18:40:28.970204image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50407
 
0.7%
33606
 
0.6%
520805
 
0.5%
47205
 
0.5%
36005
 
0.5%
448005
 
0.5%
336005
 
0.5%
427204
 
0.4%
415804
 
0.4%
350004
 
0.4%
Other values (716)950
95.0%
ValueCountFrequency (%)
701
0.1%
14402
0.2%
16802
0.2%
17501
0.1%
17601
0.1%
18001
0.1%
19602
0.2%
19801
0.1%
20301
0.1%
20801
0.1%
ValueCountFrequency (%)
795601
0.1%
777601
0.1%
776702
0.2%
764001
0.1%
760001
0.1%
756001
0.1%
755301
0.1%
747901
0.1%
736201
0.1%
732601
0.1%

auto_make
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct14
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Saab
80 
Dodge
80 
Suburu
80 
Nissan
78 
Chevrolet
76 
Other values (9)
606 

Length

Max length10
Median length6
Mean length5.703
Min length3

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSaab
2nd rowMercedes
3rd rowDodge
4th rowChevrolet
5th rowAccura

Common Values

ValueCountFrequency (%)
Saab80
 
8.0%
Dodge80
 
8.0%
Suburu80
 
8.0%
Nissan78
 
7.8%
Chevrolet76
 
7.6%
Ford72
 
7.2%
BMW72
 
7.2%
Toyota70
 
7.0%
Audi69
 
6.9%
Accura68
 
6.8%
Other values (4)255
25.5%

Length

2022-02-20T18:40:29.304668image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
saab80
 
8.0%
dodge80
 
8.0%
suburu80
 
8.0%
nissan78
 
7.8%
chevrolet76
 
7.6%
ford72
 
7.2%
bmw72
 
7.2%
toyota70
 
7.0%
audi69
 
6.9%
accura68
 
6.8%
Other values (4)255
25.5%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

auto_model
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct39
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
RAM
 
43
Wrangler
 
42
A3
 
37
Neon
 
37
MDX
 
36
Other values (34)
805 

Length

Max length14
Median length5
Mean length5.178
Min length2

Characters and Unicode

Total characters0
Distinct characters0
Distinct categories0 ?
Distinct scripts0 ?
Distinct blocks0 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row92x
2nd rowE400
3rd rowRAM
4th rowTahoe
5th rowRSX

Common Values

ValueCountFrequency (%)
RAM43
 
4.3%
Wrangler42
 
4.2%
A337
 
3.7%
Neon37
 
3.7%
MDX36
 
3.6%
Jetta35
 
3.5%
Passat33
 
3.3%
A532
 
3.2%
Legacy32
 
3.2%
Pathfinder31
 
3.1%
Other values (29)642
64.2%

Length

2022-02-20T18:40:29.427375image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
ram43
 
4.1%
wrangler42
 
4.0%
a337
 
3.5%
neon37
 
3.5%
mdx36
 
3.5%
jetta35
 
3.4%
passat33
 
3.2%
a532
 
3.1%
legacy32
 
3.1%
pathfinder31
 
3.0%
Other values (31)685
65.7%

Most occurring characters

ValueCountFrequency (%)
No values found.

Most occurring categories

ValueCountFrequency (%)
No values found.

Most frequent character per category

Most occurring scripts

ValueCountFrequency (%)
No values found.

Most frequent character per script

Most occurring blocks

ValueCountFrequency (%)
No values found.

Most frequent character per block

auto_year
Real number (ℝ≥0)

Distinct21
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2005.103
Minimum1995
Maximum2015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2022-02-20T18:40:29.604603image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Quantile statistics

Minimum1995
5-th percentile1995
Q12000
median2005
Q32010
95-th percentile2014
Maximum2015
Range20
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.015860835
Coefficient of variation (CV)0.003000275215
Kurtosis-1.171867756
Mean2005.103
Median Absolute Deviation (MAD)5
Skewness-0.04828880711
Sum2005103
Variance36.19058158
MonotonicityNot monotonic
2022-02-20T18:40:29.787937image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
199556
 
5.6%
199955
 
5.5%
200554
 
5.4%
200653
 
5.3%
201153
 
5.3%
200752
 
5.2%
200351
 
5.1%
200950
 
5.0%
201050
 
5.0%
201349
 
4.9%
Other values (11)477
47.7%
ValueCountFrequency (%)
199556
5.6%
199637
3.7%
199746
4.6%
199840
4.0%
199955
5.5%
200042
4.2%
200142
4.2%
200249
4.9%
200351
5.1%
200439
3.9%
ValueCountFrequency (%)
201547
4.7%
201444
4.4%
201349
4.9%
201246
4.6%
201153
5.3%
201050
5.0%
200950
5.0%
200845
4.5%
200752
5.2%
200653
5.3%

fraud_reported
Boolean

HIGH CORRELATION
HIGH CORRELATION

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
False
753 
True
247 
ValueCountFrequency (%)
False753
75.3%
True247
 
24.7%
2022-02-20T18:40:29.914011image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Interactions

2022-02-20T18:40:14.438876image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:36.134824image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:38.677044image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:41.546978image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:44.607433image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:48.101206image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:50.941619image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:53.516104image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:56.341125image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:59.120269image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:02.016824image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:04.866949image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:07.877213image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:11.521548image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:14.632711image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:36.260487image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:38.855546image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:41.712828image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:44.945759image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:48.290493image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:51.112957image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:53.698246image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:56.689091image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:59.308155image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:02.170410image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:05.096263image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:08.070880image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:11.750147image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:14.820917image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:36.382193image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:39.017586image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:41.860981image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:45.262787image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:48.478299image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:51.276845image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:53.866255image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:56.848713image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:59.489282image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:02.313217image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:05.337230image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:08.263439image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:11.956202image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:14.997529image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:36.516801image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:39.203615image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:42.011335image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:45.595840image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:48.663330image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:51.454204image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:54.036641image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:57.002111image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:59.666993image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:02.466434image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:05.589567image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:08.455952image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:12.155020image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:15.228381image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:36.663410image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:39.408740image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:42.185733image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:45.951060image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:48.853607image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:51.641157image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:54.224154image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:57.166991image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:59.859108image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:02.643456image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:05.857485image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:08.691339image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:12.344366image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:15.480154image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:36.796056image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:39.656647image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:42.374791image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:46.202712image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:49.028913image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:51.807753image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:54.416024image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:57.329957image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:00.122545image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:02.982105image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:06.099682image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:08.892012image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:12.526811image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:15.733923image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:36.946948image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:39.885941image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:42.566464image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:46.467308image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:49.212208image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:51.972646image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:54.611614image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:57.503719image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:00.388736image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:03.161884image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:06.355655image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:09.096212image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:12.738192image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:15.952269image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:37.251839image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:40.086641image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:42.736852image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:46.668234image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:49.380386image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:52.122744image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:54.809158image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:57.673400image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:00.607134image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:03.325816image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:06.562392image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:09.279720image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:12.918001image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:16.150301image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:37.423378image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:40.295547image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:42.921357image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:46.843085image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:49.553393image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:52.304295image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:55.025871image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:57.855283image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:00.826468image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:03.508045image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:06.764585image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:09.502267image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:13.089357image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:16.336850image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:37.598907image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:40.487152image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:43.095506image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:47.029063image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:49.775940image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:52.489531image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:55.242140image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:58.039109image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:01.075397image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:03.678280image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:06.953539image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:09.926537image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:13.263408image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:16.549790image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:37.806352image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:40.734927image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:43.306768image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:47.223806image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:49.991365image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:52.715537image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:55.471688image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:58.224940image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:01.281627image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:03.853697image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:07.153477image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:10.181659image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:13.483281image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:16.929003image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:38.010905image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:40.923324image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:43.753178image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:47.408593image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:50.191612image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:52.905845image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:55.686310image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:58.455123image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:01.456668image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:04.132563image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:07.323098image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:10.428163image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:13.775575image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:17.108356image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:38.205285image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:41.126845image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:44.029740image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:47.616473image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:50.388727image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:53.098137image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:55.909538image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:58.658251image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:01.639475image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:04.402334image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:07.476755image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:10.810546image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:13.993555image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:17.290665image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:38.453626image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:41.341824image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:44.307182image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:47.852642image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:50.757458image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:53.305861image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:56.120477image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:39:58.889616image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:01.836693image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:04.647309image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:07.681349image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:11.232656image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
2022-02-20T18:40:14.213678image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Correlations

2022-02-20T18:40:30.026112image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-02-20T18:40:30.343502image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-02-20T18:40:30.701860image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-02-20T18:40:31.089823image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.
2022-02-20T18:40:31.496364image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2022-02-20T18:40:17.757829image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
A simple visualization of nullity by column.
2022-02-20T18:40:19.165673image/svg+xmlMatplotlib v3.5.1, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

months_as_customeragepolicy_numberpolicy_bind_datepolicy_statepolicy_cslpolicy_deductablepolicy_annual_premiumumbrella_limitinsured_zipinsured_sexinsured_education_levelinsured_occupationinsured_hobbiesinsured_relationshipcapital-gainscapital-lossincident_dateincident_typecollision_typeincident_severityauthorities_contactedincident_stateincident_cityincident_locationincident_hour_of_the_daynumber_of_vehicles_involvedproperty_damagebodily_injurieswitnessespolice_report_availabletotal_claim_amountinjury_claimproperty_claimvehicle_claimauto_makeauto_modelauto_yearfraud_reported
0328485215852014-10-17OH250/50010001406.910466132MALEMDcraft-repairsleepinghusband5330002015-01-25Single Vehicle CollisionSide CollisionMajor DamagePoliceSCColumbus9935 4th Drive51YES12YES7161065101302052080Saab92x2004Y
1228423428682006-06-27IN250/50020001197.225000000468176MALEMDmachine-op-inspctreadingother-relative002015-01-21Vehicle Theft?Minor DamagePoliceVARiverwood6608 MLK Hwy81?00?50707807803510MercedesE4002007Y
2134296876982000-09-06OH100/30020001413.145000000430632FEMALEPhDsalesboard-gamesown-child3510002015-02-22Multi-vehicle CollisionRear CollisionMinor DamagePoliceNYColumbus7121 Francis Lane73NO23NO346507700385023100DodgeRAM2007N
3256412278111990-05-25IL250/50020001415.746000000608117FEMALEPhDarmed-forcesboard-gamesunmarried48900-624002015-01-10Single Vehicle CollisionFront CollisionMajor DamagePoliceOHArlington6956 Maple Drive51?12NO634006340634050720ChevroletTahoe2014Y
4228443674552014-06-06IL500/100010001583.916000000610706MALEAssociatesalesboard-gamesunmarried66000-460002015-02-17Vehicle Theft?Minor DamageNoneNYArlington3041 3rd Ave201NO01NO650013006504550AccuraRSX2009N
5256391045942006-10-12OH250/50010001351.100478456FEMALEPhDtech-supportbungie-jumpingunmarried002015-01-02Multi-vehicle CollisionRear CollisionMajor DamageFireSCArlington8973 Washington St193NO02NO641006410641051280Saab952003Y
6137344139782000-06-04IN250/50010001333.350441716MALEPhDprof-specialtyboard-gameshusband0-770002015-01-13Multi-vehicle CollisionFront CollisionMinor DamagePoliceNYSpringfield5846 Weaver Drive03?00?7865021450715050050NissanPathfinder2012N
7165374290271990-02-03IL100/30010001137.030603195MALEAssociatetech-supportbase-jumpingunmarried002015-02-27Multi-vehicle CollisionFront CollisionTotal LossPoliceVAColumbus3525 3rd Hwy233?22YES515909380938032830AudiA52015N
827334856651997-02-05IL100/3005001442.990601734FEMALEPhDother-servicegolfown-child002015-01-30Single Vehicle CollisionFront CollisionTotal LossPoliceWVArlington4872 Rock Ridge211NO11YES277002770277022160ToyotaCamry2012N
9212426365502011-07-25IL100/3005001315.680600983MALEPhDpriv-house-servcampingwife0-393002015-01-05Single Vehicle CollisionRear CollisionTotal LossOtherNCHillsdale3066 Francis Ave141NO21?423004700470032900Saab92x1996N

Last rows

months_as_customeragepolicy_numberpolicy_bind_datepolicy_statepolicy_cslpolicy_deductablepolicy_annual_premiumumbrella_limitinsured_zipinsured_sexinsured_education_levelinsured_occupationinsured_hobbiesinsured_relationshipcapital-gainscapital-lossincident_dateincident_typecollision_typeincident_severityauthorities_contactedincident_stateincident_cityincident_locationincident_hour_of_the_daynumber_of_vehicles_involvedproperty_damagebodily_injurieswitnessespolice_report_availabletotal_claim_amountinjury_claimproperty_claimvehicle_claimauto_makeauto_modelauto_yearfraud_reported
990286436631901994-02-05IL100/3005001564.433000000477644FEMALEMDprof-specialtymoviesunmarried77500-328002015-01-31Single Vehicle CollisionRear CollisionMinor DamageFireNYNorthbrook4755 1st St181?22YES342903810381026670JeepGrand Cherokee2013N
991257441093922006-07-12OH100/30010001280.880433981MALEMDother-servicebasketballother-relative59400-322002015-02-06Single Vehicle CollisionRear CollisionTotal LossOtherWVRiverwood5312 Francis Ridge211NO01NO469800522041760AccuraTL2002N
99294262152782007-10-24IN100/300500722.660433696MALEMDexec-managerialcampinghusband5030002015-01-23Multi-vehicle CollisionFront CollisionMajor DamageFireOHSpringfield1705 Weaver St63YES12YES367003670734025690NissanPathfinder2010N
993124286745702001-12-08OH250/50010001235.140443567MALEMDexec-managerialcampinghusband0-321002015-02-17Multi-vehicle CollisionSide CollisionTotal LossOtherOHHillsdale1643 Washington Hwy203?01?602006020602048160VolkswagenPassat2012N
994141306814862007-03-24IN500/100010001347.040430665MALEHigh Schoolsalesbungie-jumpingown-child0-821002015-01-22Parked Car?Minor DamageNoneSCNorthbend6516 Solo Drive61?12YES648054010804860HondaCivic1996N
9953389418511991-07-16OH500/100010001310.800431289FEMALEMasterscraft-repairpaintballunmarried002015-02-22Single Vehicle CollisionFront CollisionMinor DamageFireNCNorthbrook6045 Andromedia St201YES01?8720017440872061040HondaAccord2006N
996285411869342014-01-05IL100/30010001436.790608177FEMALEPhDprof-specialtysleepingwife7090002015-01-24Single Vehicle CollisionRear CollisionMajor DamageFireSCNorthbend3092 Texas Drive231YES23?108480180801808072320VolkswagenPassat2015N
997130349185162003-02-17OH250/5005001383.493000000442797FEMALEMastersarmed-forcesbungie-jumpingother-relative3510002015-01-23Multi-vehicle CollisionSide CollisionMinor DamagePoliceNCArlington7629 5th St43?23YES675007500750052500SuburuImpreza1996N
998458625339402011-11-18IL500/100020001356.925000000441714MALEAssociatehandlers-cleanersbase-jumpingwife002015-02-26Single Vehicle CollisionRear CollisionMajor DamageOtherNYArlington6128 Elm Lane21?01YES469805220522036540AudiA51998N
999456605560801996-11-11OH250/5001000766.190612260FEMALEAssociatesaleskayakinghusband002015-02-26Parked Car?Minor DamagePoliceWVColumbus1416 Cherokee Ridge61?03?50604609203680MercedesE4002007N